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Attorney Docket No. 24460 
Serial No. 09/718, 425 

AMENDMENTS TO THE SPECIFICATION 

Please amend the Specification as indicated in the replacement 
paragraphs below. 

Following the paragraph ending at page 8, line 13, insert the 
following new paragraphs: 

The terms "Zif268" and "SEQ. ID NO. 1" are used 
interchangeably herein. 

The terms "FSD-1" and "SEQ. ID NO. 2" are used interchangeably 
herein. 

The terms "sequence A" and "SEQ. ID NO. 3" are used 
interchangeably herein. 

The terms "sequence B" and "SEQ. ID NO. 4" are used 
interchangeably herein. 

The terms "Gpl" and "SEQ. ID NO. 5" are used interchangeably 
herein. 

The terms "sequence C" and "SEQ. ID NO. 6" are used 
interchangeably herein. 

The terms "sequence D" and "SEQ. ID NO. 7" are used 
interchangeably herein. 

The terms "G31-c3b4" and "SEQ. ID NO. 8" are used 
interchangeably herein. 
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Please replace the paragraph at page 9, lines 3-12, with the 

following: 

According to the method of the invention, a reduced virtual 
representation is first constructed for the predefined 3D 
structure. The reduced representation is may be obtained by the 
methodology originally developed by Herzyk and Hubbard for use with 
dynamic simulated annealing [Herzyk P. and Hubbard R.E. Proteins 
17:310-324 (1993)]. According to this methodology, the amino acids 
are represented by virtual spherical atoms, wherein the main chain 
of the protein, polypeptide or any other suitable polymer is 
represented by one virtual atom per residue located at the Ca 
position and the side chains are represented by one or more 
additional virtual atoms. The number of additional virtual atoms 
depends on the size and chemical composition of the specific side 
chain . 

Following the paragraph ending at page 9, line 12, insert the 
following new paragraph: 

In the 'reduced representation' disclosed in the Herzyk et al 
reference, each amino acid residue is represented by one or more 
spheres, and at least one amino acid is represented by two or more 
spheres. The virtual bonds and excluded volume of these atoms were 
parameterized by analysis of 83 protein structures determined by 
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x-ray crystallography. The authors used this representation in 
restraint satisfaction calculation with dynamic simulated annealing 
in the context of NOE distance constraints. They compared the 
results obtained with this reduced representation to previous 
results obtained using an all-atom representation for the 
determination of the structure of crambin, echistatin and protein 
G from their experimental NOE data. They found that their ' reduced 
representation' permitted a 30-fold decrease in computer time for 
generating a single structural solution, and a 20-fold decrease in 
computer time to produce an acceptable structure, compared to the 
use of an all-atom model. The RMS distances between the reduced 
representation model and the all atom model were 1.5A to 1.7A. 

Please replace the paragraph at page 17, line 24 to page 18, 
line 10, with the following: 

In order to examine examined the method according to the 
invention the (3pa motif typified by the zinc finger DNA binding 
module in the zinc finger protein, Zif268 was used. Zif268 is a 
well recognized protein which has the sequence shown in SEP. ID NO. 
1. This protein is small enough to be both computationally and 
experimentally tractable, yet large enough to form an independently 
folded structure in the absence of disulfide bonds or metal 
binding. Although this motif consists of fewer than 30 residues, it 
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does contain sheet, helix and turn structures. By the method and 
system of the invention the entire amino acid sequence: the buried 
core, the solvent exposed surface and the boundary between core and 
surface, except for the Gly27, which was not mutated during the 
simulation, was computed. The input coordinates are those of 
residues 33-60 of the native proteins obtained from the X-ray 
structure coordinate of Zif268 immediate early gene (krox-24) 
complex with an 11 base pair DNA fragment (Protein Data Bank (PDB) 
code: 1ZAA) , as determined at 2 . lA resolution by Pavletich and Pabo 
[Pavletich N.P., Pabo Science 252:809-817 (1991)]. Recently, this 
protein was also analyzed by Dahiyat & Mayo(13). 

Please replace Table 1 and its corresponding legend at page 



18, lines 18-22, with the following: 



Residue No 


5 112 2 




0 5 0 5 


2 nd Struct 0 * 


EEE TT EEE HHHHHHHHHHHH 


SEO.IDNO. 


KPFQCRICMRNFSRSDHLTTHIRTHTGE 


l&f266P> 


SA< 3 > 


eei ibeeeeeeieeeeeieei ieeieee 


D&M's SA (4) 


eeiebeieeeeieeeeeieei ieeieee 



(1) Secondary structure containing Extended sheet (E), Turn (T) or Helix (H); 

(2) SEP. ID NO. 1 corresponds to the Zif268 wild-type sequence written in one letter code; 

(3) Solvent accessibility as determined by the invention, categorized as buried (b), exposed (e) or 
intermediate (i); 

(4) Solvent accessibility as determined by Dahiyat and Mayo(13). 
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Please replace the section of the specification, including 
Table 2A and its corresponding legend, at page 20, line 20 to page 
21, line 12, with the following: 

Tables 2A and 2B present the lowest energy sequences obtained 
in the first and second sets (A and B respectively) , aligned with 
the second zinc finger module of the DNA binding protein Zif268 and 
with D&M designed sequence, FSD-1 , SEP. ID NO. 2 . The coordinates 
used for the FSD-1 P3a motif score evaluations are the experimental 
NMR coordinates (PDB code 1FSD) , which were found by D&M(13). All 
the energy scores in Table 2B were calculated according the method 
of the present invention's reduced representation of amino acids 
and its scoring function. A and B scores were found to be lower 
than both Zif268 score (without considering the His2Cys2 Zn-binding 
interactions which are not included in the scoring function) , and 
the FSD-1 score. The energy score of the most stable sequence, A, 
corresponding to SEP. ID NO 3. is -351 . 8kcal/mol . This score is 
lower than Zif268 score by 111 . 3kcal/mol which is a significant 
difference (not taking tacking into account the Zn interactions) . 
The relative stability of both SEP. ID NP 3 and SEP. ID NO 4 A and 
D sequenc e s in comparison to SEP. ID NO. 2 the FOD 1 sequence , may 
be in part due to the fact SEP. ID NO. 2 the FOD 1 sequence was 
designed with a different scoring function. 
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Table 2A - The most stable sequneece obtained for the Zim finger 
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(1) Secondary structure containing Extended sheet (E), Turn (T) or Helix (H); 

(2) Solvent Accessibility as determined by the invention, categorized as buried (b), exposed (e) or 
intermediate (i) (all other positions are exposed); 

(3) The sequence as designed by D&M (13) ; 

(4) SEP. ID N O. 1 corresponds to wild-type Zif268 sequence written in one letter code; 

(5) Sequence obtained by the present invention using the SA calculation described herein; 

(6) Sequence obtained by the present invention using D&M SA fitted assignments. 



Please replace the section at page 22, line 21 to page 23, 
line 19, with the following: 

3. Positions 21 and 25 of the optimal sequences were 
selected to be Phe or Met (position 21) and Leu (position 25) side 
chains. In the original SEO. ID NO, 1 Zif2C0 , these positions were 
occupied by the zinc binding His residue. These positions are more 
than 80 percent buried. Position 5, which is 100 percent buried, 
was predominantly selected to be Val. The other boundary positions 
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demonstrate the steric constrains on buried residues by packing 
similar side chains to those of the original SEP. ID NO. 1 Zif2G0 
sequence . 

4. In the helix region (residues 15-26) the algorithm placed 
two Leu side chains and one Gin, which are good helix forming 
residues, in SEP. ID NO. 4 sequence D , and one Leu and one Gin in 
SEP. ID NO. 3 sequence A . 

5. In both SEP. ID NO. 3 and SEP. ID NO. 4 A and D 
sequences, position 5 on the exposed sheet surface was selected by 
the algorithm to be Val, which is a very good 3-sheet forming 
residue, and positions 4 and 10 (and 11 only in SEP. ID NO .4 
sequence D) were selected to be Thr, which is also a good p-sheet 
forming residue. 

6. Alignment of the optimal stable SEP. ID NP. 4 sequence 
"W and SEP. ID NP. 1 Zif2G0 indicates that 4 out of 27 residues 
(not including residue 27 that remains Gly throughout the 
simulation) are identical (15%) and 11 are similar (including the 
identical 40.7%). D&M obtained similar values, with 5 identical 
residues (18.5%) and 12 similar (44.4%). 

7 . Alignment of the SEP. ID NP. 4 sequence D and SEP. ID NP. 
2 FOD-1 indicates that 5 out of 27 residues are identical between 
the sequences (18.5%) and 11 are similar (including identical 
40.7%) . 
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7.1 Secondary structure prediction of the designed sequence 
SEQ. ID NO. 3 and SEP. I D NO. 4 Dequence A and D were further 
examined by secondary structure prediction by the SSPAL predictor 
at Sanger Centere [Salamov A. A. and Solovyev V. V. J. Mol . Biol. 
247:11-15 (1995)], which enable to predict the secondary structure 
of a protein according to its primary structure (amino acid 
sequence) . By these programs both SEP. ID NO. 3 and SEP. ID NO. 4 
A and D acquwiiuBS were predicted to have the desired Zinc finger 
motif. Table 3 presents the secondary structure of the native 
protein (Zif268) according to the Protein Data Bank (PDB), and SEP. 
ID NP. 3 and SEP. ID NO. 4 A and D secondary structure prediction, 
according to SSPAL algorithm at Sanger Centre. SEP. ID NO. 3 A- was 
predicted to have one a-helix (designated H) and two p-strands 
(designated E) (the ppa motif) while the predicted secondary 
structure to SEP. ID NO. 4 B contained only one a-helix and one 
[S-strand. 

Please replace Table 3 and its corresponding legend at page 
24, lines 3-5, with the following: 
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Table 3 - Secondary stnnctunre off predicted primary stractmres A amid B. 
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Secondary Structure 



Please replace the paragraph at page 24, line 7 to page 25, 
line 3, with the following: 

The reduced representation of the lowest energy designed SEO. 
ID NO. 3 and SE O. ID NO, 4 sequences A arid D , was expanded to an 
all-atom representation, using the molecular mechanics package 
CHARMM. The input for this experiment was the backbone coordinates 
of the native protein, the new designed residues and the dihedral 
angles of each position along the designed sequence derived from 
the rotamer with the lowest energy score. The number of atoms of 
SEQ. ID NO. 3 an d SEO. ID NO. 4 A and D after expansion to all 
atoms, were 459 and 446, respectively. Energy minimization was 
performed for SEO. ID NO. 3 and SEO. ID NO. 4 A and D's side chains 
as well as to SEO. ID NO. 1 2if2G0 side chains using CHARMM 

10 



Attorney Docket No. 24460 
Serial No. 09/718,425 

forcefield, the SHAKE algorithm [Van Gunsteren W.F. & Berendsen 
H.J.C. Mol. Phys. 34:1311 (1977)], a dielectric constant of e=l and 
a 12A energy cutoff. The minimization included 200 steps of SD 
(Steepest Descent) and then additional 500 steps of ABNR (Adopted 
Basis Newton Raphson) . After minimization, each of the three 
structures were embedded in an 18A water sphere which included 
-1870 water molecules of type TIP3P [Jorgenes W.L. et al. J Chem. 
Phys. 79:926-935 (1983)]. Each of the water-protein systems of SEP. 
ID NO. 3 and SEP. ID NO. 4 A and D were simulated for 500ps at 300K 
(with a 16A energy cutoff) and a sample of 2000 conformations was 
collected from the resulting molecular dynamics trajectory. It was 
found that the secondary structure of the proteins was maintained 
during the molecular dynamic simulations. 



Please replace Table 4, at page 28, lines 24 1-5, with the 
following: 
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Please replace the paragraph at page 28, lines 12-17, with the 

following: 

2. In most sequences Thr25 was mutated to Leu, which has a 
better helix propensity. Examining of SEP. ID NO. 6 and SEP, ID NO. 
1 sequences C and D by SSPAL predictor at Sanger Centre (as 
described hereinbefore) and by the PHD predictor at EMBL [Rost B 
and Sander C. J. Mol. Biol. 232:584-599 (1993)] showed that 
mutations T25L and V29L maintained the secondary structure of the 
a-helix. 

Please replace the paragraph at page 29, line 11 to page 30, 
line 7, with the following: 

1.4 Minimization of the redesigned sequences in comparison to 
native G(31 of SEP. ID NP. 5 

The reduced representation of the designed SEP. ID NO. 6 
sequence C, which was the sequence with the lowest energy score in 
the first set of simulations, where the conformation of 48 
non-mutated residues was kept fixed, was expanded to its all-atom 
representation, using CHARMM [Brooks B.R. et al. (1983) ibid.], in 
the same manner as described hereinbefore. In general, the 
information provided as input was the backbone coordinates of the 
native protein, the new residues and the rotamer dihedral angles at 
each position along the chain. The number of atoms in SEP. ID NO. 
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6. sequence — 6 was 865. Energy minimization was performed for the 
side chains of this sequence as well as for G31 f s side chains using 
CHARMM force-field [Mackerel A.D. et. Al. (1998) ibid.], a 
dielectric constant e=l and a 16A energy cutoff. The minimization 
included 800 steps of SD and then additional 1100 steps of ABNR. 
The average energies of the sequences after minimization were:- 

SEO. ID NO. 5 efti: -804.2 kcal/mol 
SEO. ID NO. 6 [[C]]: -822.7 kcal/mol 

The above results suggest that the mutations obtained by the 
methodology disclosed herein are tolerable which may lead to the 
designed of a more stable protein. 

The difference between the energies of the native sequence 
SEO. ID NO. 5 SfH: and SEO. ID NO. 6 sequence — B, based on the 
present invention's scoring function and on CHARMM ! s force-field 
(after dynamics) were 7% and 16% respectively, which strengthens 
the conclusion that the method and system of the present invention 
provide a reliable tool for designing de novo proteins. 

The above statement is further strengthened in light of Figure 

7, which show a comparison of the 3D structure of SEO. ID NO. 5 GfH: 
and SEO. ID NO. 6 sequence C . 
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